Computer and Information Science by Roger Lee

Computer and Information Science by Roger Lee

Author:Roger Lee
Language: eng
Format: epub, pdf
Publisher: Springer International Publishing, Cham


3 Methodology

3.1 Data Selection

The research utilized a publicly available dataset containing 10, 000 births recorded at the North Carolina State Centre for Health Statistics in 2006. The target variable was the birth weight group (grams), and it had 10 classes: 0 (500 or less), 1 (501–1000), 2 (1001–1500), 3 (1501–2000), 4 (2001–2500), 5 (2501–3000), 6 (3001–3500), 7 (3501–4000), 8 (4001–4500), and 9 (4501 or more). Initially, the dataset contained 131 predictor variables. In this section, relevant variables for model building were selected. The data file description was used to gain understanding of the variables. Anomalies and redundant variables were eliminated. The final number of attributes for initial modeling was 90 and was exported to Waikato Environment for Knowledge Analysis (WEKA) workbench for preprocessing.



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.